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Figure 1: Top ten NCBI non redundant protein database hits for INSP081. 

BLAST P 2.2.2 [ Jan-08-2002 ] 

Reference: Altschul, Stephen F., Thomas L . Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= INSP081.pep 
(1132 letters) 

Database: ncbi-nr 

1,039,718 sequences; 328,277,582 total letters 

Searching done 



Score E 

Sequences producing significant alignments: (bits) value 



dbj |BAA87066.1| (AB035267) Nck-interacting kinase-like embryo 


sp. . . 


1223 


0. 


0 


ref |NP_038752. 1| (NM_013724) Nik related kinase [Mus musculus] 


>. . . 


1220 


0. 


0 


ref |XP_125446.1| (XM_125446) similar to Nck-interacting kinase 


-1. . . 


1160 


0. 


0 


ref |XP_109557.1| (XM_109557) similar to Nik related kinase (Mus ... 


747 


0. 


0 


ref |NP_004825. 1 1 (NM_004834) mitogen-activated protein kinase 


ki. . . 


393 


e- 


108 


gb|AAF03788.1|AF172270_l (AF172270) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 


gb| AAF03786.il AF172268_1 (AF172268) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 


gb|AAF03789.1|AF172271_l (AF172271) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 


gb|AAF03785.1|AFl72267_l (AF172267) Traf2 and NCK interacting 


ki . . . 


392 


e- 


107 


dbj |BAA25477.2| (AB011123) KIAA0551 protein [Homo sapiens] 




392 


e- 


107 


gb|AAF03783.1|AF172265_l (AF172265) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 


gb|AAF03782.1|AFl722 64_l (AF172264) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 


gbf AAF03787.il AF1722 69_1 (AF172269) Traf2 and NCK interacting 


ki. . . 


392 


e- 


107 
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Figure 2: INSP081 top blastp alignment result from NCBI non-redundant protein 
database sequences. 



>dbj |BAA87066.1| (AB035267) Nck-interacting kinase-like embryo specific kinase [Mus 
musculus] 
Length = 1455 



Score = 1223 bits (3165), Expect =0.0 

Identities = 669/1154 (57%), Positives = 760/1154 (64%), Gaps 



47/1154 (4%) 



Query: 


1 


Sbjct: 


1 


Query: 


61 


Sbjct: 


61 


Query: 


121 


Sbjct: 


121 


Query: 


181 


Sbjct: 


181 


Query: 


241 


Sbjct: 


241 


Query: 


301 


Sbjct: 


301 


Query: 


361 


Sbjct: 


361 


Query: 


421 


Sbjct: 


421 


Query: 


481 


Sbjct: 


477 


Query: 


541 


Sbjct: 


532 


Query : 


601 


Sbjct: 


592 


Query: 


661 


Sbjct: 


652 


Query: 


721 


Sbjct: 


712 


Query: 


781 


Sbjct: 


769 



MAG PGGWRD RE VT DLG HLPDPTGIFSL DKT I GLGT YGRI YLG LH EKTGA FT A V K VMN ARK 60 
MAGPG WRD+EVTDLG LPDPTGI FSLDK IGLGTYGRI+LG+HEKTG+ AVKVM+ARK 
MAG PGS WRDKEVT DLGQL PD PTG I FSLDKAIGLGTYGRIFLGIHEKTGSLVAVKVMSARK 60 

TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVSFYGAFFKLSPPG 120 
TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIV+FYGAFFKL+PPG 
TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVTFYGAFFKLNPPG 120 

QRHQLWMVMELCAAGSVTDWRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 180 
+HQLWMVMELCAAGSVTDWRMT NQSLKEDWIAYICREILQGLAHLHAH+VIHRDIKG 



QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 240 
QNVLLTH+AEVK+VDFGVSAQVSRTNGRRNSFIGTPYWMAPEVI CDEDPR SYDYRSDV 



WSVGITAIEMAEGAPPLC LQPLEAL VILRE+AP VKSSGWSRKF NFME C IKNFLF 



RPTSANMLQHPFVRDIKNERHWESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRR 360 
RPTS NML HPFV DIKNER VVESLT+HLTGII+KR+KKGIP+ FE EEA KEQY RR 

RPTSGNMLLHPFVHDIKNERRWESLTKHLTGIIQKREKKGIPVAFEGEEAAKEQYITRR 360 

FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPXXXXXXXXXXXXRVFMPLQ 420 
FRGPSCT ELLR+PTSSRCRPLRVLHGEP QPRWLPD+E+P VFMPL 

FRGPSCTPELLRVPTSSRCRPLRVLHGEPPQPRWLPDQEDPQDQELQQLQKAAGVFMPLH 420 



+ D+ K Q + L+GAA+V MP+ QV+A P Q+ 

SQDNTSKLFPKQVEVAPYLRGAAQWMPVLVQVEA PPQVSKAAQMLRSLPTQDNKA 476 



+V+AP 
TSPEVQAP- 



+P+DLDQVPEEFQGQD+ 
-VAEGQQAQHEALETEQPKDLDQVPEEFQGQDRAPEQPRQGQAAEQQQ 531 



HN VPEQ ++V+AQV LPLLSQ+ HVLL 

IHNPVPEQPPEEDREPEQAEVQEEAVEPPQAEIEDKEPEVVQVHAQVLLPLLSQNRHVLL 591 

PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 660 
PLHLD Q+LIPV Q E P+AQAW LE +A+G+VQALIEGLSRDLLRAPN+ +KPLG 
PLHLDRQLLIPVGEQNEEVPRAQAWDLEASRAVGAVQALIEGLSRDLLRAPNAFVTKPLG 651 

PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 720 
PLQ +ENLS++ FY++PE ++KKSKV++LR+A+AKRL PKRFRAK+ WR E E SD+ 
PLQI FLENLSTDGFYTEPEPTQKKKSKVASLRKAI AKRLRPKRFRAKALWRLEDFEFSDV 711 

EAXXXXXXXXWEDIFNQHEEELRQVXXXXXXXXXXXXXVFHSIQAEVQIEPLKPYISNPK 780 
E WEDI FNQHEE+LR+V VFHSIQAEVQIE P+ +NP 
ETSRRRRHRRWEDIFNQHEEQLRRVENDREDDSSDNDEVFHSIQAEVQIE PHAANPA 768 

K I EVQERS PS VPNNQDHAHH VKFSS RTWHMLFCL FI S V PQRSLLEQAQKPI DI RQRSSQN 840 
EV ERS +P N++ H VKFS L + PQ +Q Q+P++IR R+ N 
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Query: 


841 


Sbjct: 


825 


Query: 


901 


Sbjct: 


885 


Query: 


961 


Sbjct: 


944 


Query: 


1021 


Sbjct : 


1002 


Query: 


1081 


Sbjct: 


1057 


Query: 


1119 


Sbjct: 


1116 



RQNWLAAXXXXXXXXXPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNE 900 

QN+ A PVT R+SQSSPPYSTIDQKLL+DIHVPDGFKVGKISPPVYLTNE 

PQNFQAQSDSSSEEDSPVTRRKSQSSPPYSTIDQKLLIDIHVPDGFKVGKISPPVYLTNE 884 

WVGYNALSEIFRNDWLTPAPVIQPPEEDGDYVELYXXXXXXXXXXXXXXXXXXXXTYDHA 960 
WVGYNALSEIF +DW+ P +PPEEDGDYVELY D A 

WVGYNALSEIFWDDWIMPTRPARPPEEDGDYVELY-DADANANGDEEVANGAYEDPRDGA 943 

NGNDDLDNQVDQANDVCXXXXXXXXXXXXXXXXXYYEAPSCPRASYGRDGSCKQDGYDGS 1020 
NG+DD++NQ+DQAN + A + PRA+Y R G K DG 

NGHDDMNNQLDQANG — YEGHGAAGYNGGDVGGNHGAAFNGPRANYPRAGILKNGHNDGR 1001 



A+ +G + A HG AA E AA G+ H AN NE + 

SAFGVFGDNAARAFHG — AAGEAGAAFGN HG AN RGNGRGN RN REAN GRN EEN G 1056 

DSGAAPSAPDHESD NKDISESSTQSDFSANHSSPSK 1118 

D P ESD N SST F SSP + 



GSGMSADANFASAI 1132 
GS +D ++ S I 
GSDFGSDISYNSPI 1129 
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Figure 3: Top ten NCBI non redundant protein database hits for INSP082. 



BLASTP 2.2.2 [ Jan-08-2002 ] 



Reference: Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 



Query= chrX_101702493-101913965_hgll .pep 
(1500 letters) 

Database: ncbi-nr 

1,039,718 sequences; 328,277,582 total letters 



Searching done 



Sequences producing significant alignments: 

dbj | BAA87066 . 1 1 (AB035267) Nck-interacting kinase-like embryo sp. 
ref |NP_038752.1| (NM_013724) Nik related kinase [Mus musculus) >. 
ref |XP_125446.1| (XM_125446) similar to Nck-interacting kinase-1. 
ref |XP_109557.1| <XM_109557) similar to Nik related kinase [Mus . 
erab | CAB06091 . 1 | (Z83850) mouse NIK serine threonine protein kina. 
ref |NP_004825.1{ (NM_004834) mitogen-activated protein kinase ki. 
gb|AAF03788.1|AF172270_l (AF172270) Traf2 and NCK interacting ki. 
gb|AAF03786.1|AF172268_l (AF172268) Traf2 and NCK interacting ki . 
gb| AAF03789.1|AF172271_1 (AF172271) Traf2 and NCK interacting ki. 
gb|AAF03785.1|AF172267_l (AF172267) Traf2 and NCK interacting ki . 
dbj | BAA25477 . 2 I (AB011123) KIAA0551 protein [Homo sapiens] 
gb|AAF03783.1|AF172265_l (AF172265) Traf2 and NCK interacting ki . 



Score 


E 


(bits) 


value 


1727 


0.0 


1724 


0.0 


1664 


0.0 


1251 


0.0 


599 


e-170 


393 


e-108 


392 


e-107 


392 


e-107 


392 


e-107 


392 


e-107 


392 


e-107 


392 


e-107 
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Figure 4: INSP082 top blastp alignment result from NCBI non-redundant protein 
database sequences. 

>db j I BAA87066 . 1 | (AB035267) Nck-interacting kinase-like embryo specific kinase [Mus 
musculus] 
Length = 1455 

Score = 1727 bits (4473), Expect = 0.0 

Identities « 923/1505 (61%), Positives = 1058/1505 (69%), Gaps = 55/1505 (3%) 



Query: 


1 


Sbjct: 


1 


Query: 


61 


Sbjct: 


61 


Query: 


121 


Sbjct: 


121 


Query: 


181 


Sbjct: 


181 


Query : 


241 


Sbjct: 


241 


Query: 


301 


Sbjct: 


301 


Query: 


361 


Sbjct: 


361 


Query: 


421 


Sbjct: 


421 


Query: 


481 


Sbjct: 


477 


Query: 


541 


Sbjct : 


532 


Query: 


601 


Sbjct: 


592 


Query: 


661 


Sbjct: 


652 


Query: 


721 


Sbjct: 


712 


Query: 


781 


Sbjct: 


769 



MAGPGGWRDREVTDLGHLPDPTGIFSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 60 
MAGPG WRD+EVTDLG LPDPTGI FSLDK IGLGTYGRI+LG+HEKTG+ AVKVM+ARK 
MAGPGSWRDKEVTDLGQLPDPTGIFSLDKAIGLGTYGRIFLGIHEKTGSLVAVKVMSARK 60 

TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVSFYGAFFKLSPPG 120 
TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIV+FYGAFFKL+PPG 
TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVTFYGAFFKLNPPG 120 

QRHQLWMVMELCAAGSVTDVVRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 180 

+HQLWMVMELCAAGSVTDWRMT NQSLKEDWIAYICREILQGLAHLHAH+VIHRDIKG 
HQHQLWMVMELCAAGSVTDVVRMTRNQSLKEDWIAYICREILQGLAHLHAHQVIHRDIKG 180 

QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 240 
QNVLLTH+AEVK+VDFGVSAQVSRTNGRRNSFIGTPYWMAPEVI CDEDPR SYDYRSDV 
QNVLLTHDAEVKIVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIHCDEDPRCSYDYRSDV 240 

WSVGITAIEMAEGAPPLCNLQPLEALFVILRESAPTVKSSGWSRKFHNFMEKCTIKNFLF 300 
WS VGI TAI EMAEGAP PLC LQPLEAL VILRE+AP VKSSGWSRKF NFME C IKNFLF 
WSVGITAIEMAEGAPPLCKLQPLEALCVILREAAPKVKSSGWSRKFQNFMENCMIKNFLF 300 

RPTSANMLQHPFVRDIKNERHWESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRR 3 60 
RPTS NML HPFV DIKNER WESLT+HLTGII+KR+KKGIP+ FE EEA KEQY RR 
RPTSGNMLLHPFVHDIKNERRVVESLTKHLTGIIQKREKKGIPVAFEGEEAAKEQYITRR 3 60 

FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPXXXXXXXXXXXXRVFMPLQ 4 20 
FRGPSCT ELLR+PTSSRCRPLRVLHGEP QPRWLPD+E+P VFMPL 
FRGPSCTPELLRVPTSSRCRPLRVLHGEPPQPRWLPDQEDPQDQELQQLQKAAGVFMPLH 420 

ALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQVKAKASKPLQMQIKXXXXXXXXXXXXXX 480 
+ D+ K Q + L+GAA+V MP+ QV+A P Q+ 

SQDNTSKLFPKQVEVAPYLRGAAQWMPVLVQVEA PPQVSKAAQMLRSLPTQDNKA 476 

XQAQVRAPRLLXXXXXXXXXXXXXXXXSEPQDLDQVPEEFQGQDQVXXXXXXXXXXXXXX 540 

+V+AP +P+DLDQVPEEFQGQD+ 
TSPEVQAP VAEGQQAQHEALETEQPKDLDQVPEEFQGQDRAPEQPRQGQAAEQQQ 531 

XHNQVPEQELXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLRVNAQVFLPLLSQDHHVLL 600 

HN VPEQ ++V+AQV LPLLSQ+ HVLL 

IHNPVPEQPPEEDREPEQAEVQEEAVEPPQAEIEDKEPEVVQVHAQVLLPLLSQNRHVLL 591 

PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 660 
PLHLD Q+LIPV Q E P+AQAW LE +A+G+VQALI EGLSRDLLRAPN+ +KPLG 
PLHLDRQLLIPVGEQNEEVPRAQAWDLEASRAVGAVQALIEGLSRDLLRAPNAFVTKPLG 651 

PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 720 
PLQ +ENLS++ FY++PE ++KKSKV++LR+A+AKRL PKRFRAK+ WR E E SD+ 
PLQIFLENLSTDGFYTEPEPTQKKKSKVASLRKAIAKRLRPKRFRAKALWRLEDFEFSDV 711 

EAXXXXXXXXWEDIFNQHEEELRQVXXXXXXXXXXXXXVFHSIQAEVQIEPLKPYISNPK 780 
E WEDI FNQHEE+LR+V VFHSIQAEVQIE P+ +NP 
ETSRRRRHRRWEDIFNQHEEQLRRVENDREDDSSDNDEVFHSIQAEVQIE PHAANPA 7 68 

KIEVQERSPSVPNNQDHAHHVKFSSRTWHMLFCLFISVPQRSLLEQAQKPIDIRQRSSQN 840 

EV ERS +P N++ H VKFS L + PQ +Q Q+P++IR R+ N 
GNEVHERSAPMPCNRNRTHRVKFSPSVGEEEPSLEEAQPQ QQQQQPMNIRPRNCLN 824 
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Query: 841 RQNWLAAXXXXXXXXXPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNE 900 

QN+ A PVT R+SQSSPPYSTIDQKLL+DIHVPDGFKVGKISPPVYLTNE 

Sbjct: 825 PQNFQAQSDSSSEEDSPVTRRKSQSSPPYSTIDQKLLIDIHVPDGFKVGKISPPVYLTNE 884 

Query: 901 WVGYNALSEIFRNDWLTPAPVIQPPEEDGDYVELYXXXXXXXXXXXXXXXXXXXXTYDHA 960 

WVGYNALSEIF +DW+ P +PPEEDGDYVELY D A 

Sbjct: 885 WVGYNALSEIFWDDWIMPTRPARPPEEDGDYVELY-DADANANGDEEVANGAYEDPRDGA 943 

Query: 961 NGNDDLDNQVDQANDVCXXXXXXXXXXXXXXXXXYYEAPSCPRASYGRDGSCKQDGYDGS 1020 

NG+DD++NQ+DQAN + A + PRA+Y R G K DG 

Sbjct: 944 NGH DDMNNQL DQANG — YEGHGAAGYNGGDVGGNHGAAFNGPRANYPRAGILKNGHNDGR 1001 

Query: 1021 RGKEEAYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIXXXXXXXXXXXXXXNEASN 1080 

A+ +G + A HG AA E AA G+ H AN NE + 

Sbjct: 1002 ALNRGAFGVFGDNAARAFHG — AAGEAGAAFGN HGANRGNGRGNRNREANGRNEENG 1056 

Query: 1081 AI — DSGAAPSAPDHESDNKDISESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFV 1138 

AD P ESD + S + ++ + G +S+ GF 
Sbjct: 1057 AFGRDQHVFPEFEHEESDRGTETSDSIALEITSFDGEQNSGRPVSSTT MGF- 1107 

Query: 1139 EVPEESPKQP SEVNVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSN 1195 

+ SP+ S+++ N P++H+YEK+F+SE+ CGSLWGVNLLLGT+S+ 

Sbjct: 1108 PIGRSSPRGSDFGSDISYN SPILHVYEKDFSSEVYCGSLWGVNLLLGTQSH 1158 

Query: 1196 LYLMDRSGKADITKLIRRRPFRQIQVLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDP 1255 

L YLMDRSGKA+ I KLI+RRPFRQIQV+E LNLLITISG KNRLRVYHL+WLRNKILNNDP 
Sbjct: 1159 LYLMDRSGKAEIVKLIKRRPFRQIQVVEQLNLLITISGKKNRLRVYHLSWLRNKILNNDP 1218 

Query: 1256 ESKRRQEEMLKTEEACKAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDES 1315 

+SK+RQ+ M K EEACKAIDKL GCEHFSVLQHEETTYIA+A+KSSIHL+AWAPKSFDE+ 
Sbjct: 1219 K S K KRQKAMRK K E EAC KA I DKL I GC EH FSVLQHEETTYI A V A V KS S I H L F AW A P K S F D EN 1278 

Query: 1316 TAIKVFPTLDHKPVTVDLAIGSEKRLKIFFSSADGYHLIDAESEVMSDVTLPKXXXXXXX 1375 

TAIKVFPT D KP+TVDLA+GSEK LKI FFSSA+GYH+I DAESEVMS+VTLP 
Sbjct: 1279 TAIKVFPTRDLKPLTVDLAVGSEKTLKIFFSSANGYHIIDAESEVMSEVTLPN 1331 

Query: 1376 XXXXXXXXDCLGIGMMLTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQ 1435 

DCLG+G+ML+ NAEA S EANEQL KKIL++WKDIPSS+AFECT+R TGW Q 
Sbjct: 1332 -NNWILPDCLGLGVMLSLNAEAASEEANEQLLKKILDVWKDIPSSVAFECTKRITGWDQ 1390 

Query: 1436 KAIEVRSLQSRVLESELKRRSIKKLRFLCTRGDKLFFTSTLRNHHSRVYFMTLGKliEELQ 1495 

KAIEVRSLQS +LE+ELKRRS I KKLRFLC RGDK+FF STL N HSRVY M+LGKLEEL 
Sbjct: 1391 KAIEVRSLQSTILENELKRRSIKKLRFLCARGDKMFFASTLSNDHSRVYLMSLGKLEELH 1450 



Query: 1496 SNYDV 1500 
+Y V 

Sbjct: 1451 RSYAV 1455 
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Figure 5: Top ten NCBI non redundant protein database hits for INSP082. 

B LAS TP 2.2.2 [ Jan-08-2002] 



Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= genomic. pep 

(1589 letters) 

Database: ncbi-nr 

1,039,718 sequences; 328,277,582 total letters 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



dbj I BAA87066.il (AB035267) Nck-interacting kinase-like embryo sp. . 
ref |NP_038752.1| (NM_013724) Nik related kinase [Mus musculus] >.. 
ref |XP_109557.1| (XM_109557) similar to Nik related kinase [Mus .. 
ref IXP_125446.1| (XM_125446) similar to Nck-interacting kinase-1.. 
emb| CAB06091 . 1 1 (Z83850) mouse NIK serine threonine protein kina. . 
ref | NP__004825 . 1 1 <NM_004834) mitogen-activated protein kinase ki.. 
gb|AAF03788.1|AF172270_l (AF172270) Traf2 and NCK interacting ki . . 
gb|AAF03786.1|AF172268_l (AF172268) Traf2 and NCK interacting ki.. 
gb|AAF03789.1|AF172271_l (AF172271) Traf2 and NCK interacting ki.. 
gb|AAF03785.1|AF172267_l (AF172267) Traf2 and NCK interacting ki.. 
dbj|BAA25477.2| (AB011123) KIAA0551 protein [Homo sapiens] 



1708 
1704 
1232 
1048 
599 
393 
392 
392 
392 
392 
392 



e-170 
e-108 
e-107 
e-107 
e-107 
e-107 
e-107 



0.0 
0.0 
0.0 
0.0 



WO 2004/022756 



PCT/GB2003/003862 



8/14 



Figure 6: INSP091 top blastp alignment result from NCBI non-redundant protein 
database sequences. 



>dbj IBAA87066.il (AB035267) Nck-interacting kinase-like embryo specific kinase [Mus 
musculus] 
Length = 1455 



Score = 1708 bits (4423), Expect = 0.0 
Identities = 935/1590 (58%), Positives 



1070/1590 (66%), Gaps = 136/1590 (8%) 



Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 



1 MAGPGGWRDREVTDLGHLPDPTGIFSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 60 

MAGPG WRD+EVTDLG LPDPTGI FSLDK IGLGTYGRI+LG+HEKTG+ AVKVM+ARK 
1 MAGPGSWRDKEVTDLGQLPDPTGIFSLDKAIGLGTYGRI FLGIHEKTGSLVAVKVMSARK 60 

61 TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVSFYGAFFKLSPPG 120 

TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIV+FYGAFFKL+PPG 
61 TPLPEIGRRVRVNKYQKSVGWRYSDEEEDLRTELNLLRKYSFHKNIVTFYGAFFKLNPPG 120 

121 QRHQLWMVMELC AAG S VT D VV RMT SNQSLKEDWIAYICREI LQGL AHLH AH RV I H RD I KG 180 

+HQLWMVMELC AAGS VT DWRMT NQSLKEDWIAYICREILQGLAHLHAH+VIHRDIKG 
121 HQHQLWMVMELCAAGSVTDVVRMTRNQSLKEDWIAYICREILQGLAHLHAHQVIHRDIKG 180 

181 QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 24 0 

QNVXLTH+AEVK+VDFGVSAQVSRTNGRRNSFIGTPYWMAPEVI CDEDPR SYDYRSDV 
181 QNVLLTHDAEVKIVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIHCDEDPRCSYDYRSDV 24 0 

241 WSVGITAIEMAEGAPPLCNLQPLEALFVILRESAPTVKSSGWSRKFHNFMEKCTIKNFLF 300 

WSVGITAIEMAEGAPPLC LQPLEAL VILRE+AP VKSSGWSRKF NFME C IKNFLF 
241 WSVGITAIEMAEGAPPLCKLQPLEALCVILREAAPKVKSSGWSRKFQNFMENCMIKNFLF 300 

301 RPTSANMLQHPFVRDIKNERHVVESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRR 360 

RPTS NML HPFV DIKNER VVESLT+HLTGI I+KR+KKGIP+ FE EEA KEQY RR 
301 RPTSGNMLLHPFVHDIKNERRVVESLTKHLTGIIQKREKKGIPVAFEGEEAAKEQYITRR 360 

361 FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPXXXXXXXXXXXXRVFMPLQ 420 

FRGPSCT ELLR+PTSSRCRPLRVLHGEP QPRWLPD+E+P VFMPL 
361 FRGPSCTPELLRVPTSSRCRPLRVLHGEPPQPRWLPDQEDPQDQELQQLQKAAGVFMPLH 420 

421 ALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQVKAKAS KPLQMQI KXXXXXXXXXXXXXX 480 

+ D+ K Q + L+GAA+V MP+ QV+A P Q+ 
421 SQDNTSKLFPKQVEVAPYLRGAAQVVMPVLVQVEA PPQVSKAAQMLRSLPTQDNKA 476 

481 XQAQVRAPRLLXXXXXXXXXXXXXXXXSEPQDLDQVPEEFQGQDQVXXXXXXXXXXXXXX 540 

+V+AP +P+DLDQVPEEFQGQD+ 
477 TSPEVQAP VAEGQQAQHEALETEQPKDLDQVPEEFQGQDRAPEQPRQGQAAEQQQ 531 

541 XHNQVPEQELXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLRVNAQVFLPLLSQDHH VLL 600 

HN VPEQ ++V+AQV LPLLSQ+ HVLL 

532 IHNPVPEQPPEEDREPEQAEVQEEAVEPPQAEIEDKEPEVVQVHAQVLLPLLSQNRHVLL 591 

601 PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 660 
PLHLD Q+LIPV Q E P+AQAW LE +A+G+VQALIEGLSRDLLRAPN+ +KPLG 
PLHLDRQLLIPVGEQNEEVPRAQAWDLEASRAVGAVQALIEGLSRDLLRAPNAFVTKPLG 651 



592 
661 
652 



PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 720 
PLQ +ENLS++ FY++PE ++KKSKV++LR+A+AKRL PKRFRAK+ WR E E SD+ 
PLQIFLENLSTDGFYTEPEPTQKKKSKVASLRKAIAKRLRPKRFRAKALWRLEDFEFSDV 711 



721 EAXXXXXXXXWEDIFNQHEEELRQVXXXXXXXXXXXXXVFHSIQAEVQIEPLKPYISNPK 780 

E WEDIFNQHEE+LR+V VFHSIQAEVQIEP + +NP 
712 ET SRRRRHRRWED I FN QHEEQLRRVENDREDDS SONDE VFHSIQAEVQIEP HAANPA 768 

781 KIEVQERSPSVPNNQDHAHHVKFSSSVPQRS-LLEQAQKPIDIRQRSSQNRQNWLAASGD 839 

EV ERS +P N++ H VKFS SV + LE+AQ 
7 69 GNEVHERSAPMPCNRNRTHRVKFSPSVGEEEPSLEEAQP 807 
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Query: 


840 


Sbjct: 


808 


Query: 


900 


Sbjct: 


840 


Query: 


960 


Sbjct: 


900 


Query: 


1020 


Sbjct: 


959 


Query: 


1080 


Sbjct: 


1017 


Query: 


1140 


Sbjct: 


1052 


Query: 


1200 


Sbjct: 


1109 


Query: 


1260 


Sbjct: 


1134 


Query: 


1320 


Sbjct: 


1194 


Query: 


1380 


Sbjct : 


1254 


Query: 


1440 


Sbjct: 


1314 


Query: 


1500 


Sbjct: 


1366 


Query: 


1560 


Sbjct: 


1426 



+++ Q+ MN + + L P+ +SSSEE+ 

-QQQQQQPMN I RPRNC LNPQN — FQAQSDSSSEED 839 



SPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFRNDW 959 
SPVT R+SQSSPPYSTIDQKLL+DIHVPDGFKVGKISPPVYLTNEWVGYNALSEIF +DW 
SPVTRRKSQSSPPYSTIDQKLLIDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFWDDW 899 

LTPAPVIQPPEEDGDYVELYXXXXXXXXXXXXXXXXXXXXTYDHANGNDDLDNQVDQAND 1019 
+ P +PPEEDGDYVELY D ANG+DD++NQ+DQAN 

IMPTRPARPPEEDGDYVELYDADANANGDEEVANGAYEDPR-DGANGHDDMNNQLDQANG 958 



+ A + PRA+Y R G K DG A+ +G + A 

Y — EGHGAAGYNGGDVGGNHGAAFNGPRANYPRAGILKNGHNDGRALNRGAFGVFGDNAA 1016 



HG 



AA E AA G+ 
-AAGEAGAAFGN- 



H AN 
-HGAN- 



SSP +GS 



E+ EE 



+D ++ S 



G E S++I 



+G N E N 
-RGNGRGNRNREANGR 1051 



SST 



P++H+Y 
-PILHVY 1133 



EK+F+SE+ CGSLWGVNLLLGT+S+LYLMDRSGKA+I KLI+RRPFRQIQV+E LNLLIT 



ISG KNRLRVYHL+WLRNKILNNDP+SK+RQ+ M K EEACKAIDKL GCEHFSVLQHEE 



TTYIA+A+KSSIHL+AWAPKSFDE+TAIKVFPT D KP+TVDLA+GSEK LKIFFSSA+G 



YH+IDAESEVMS+VTLP 



DCLG+G+ML+ NAEA S EANEQL KK 
- NN VV I L P DCLGLG VML SLN AEAASEEAN EQLLKK 1365 



IL++WKDI PSS+AFECT+R TGW QKAI EVRSLQS +LE+ELKRRSIKKLRFLC RGDK+ 



FF STL N HSRVY M+LGKLEEL +Y V 
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Figure 7: CLUSTAL alignment of INSP081, 082, 091 & published ZC4 coding 
sequences. 



+** = kinase domain 



ZC4 MAGPGGWRDREVTDLGHLPDPTGIFSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 

INSP081 MAGPGGWRDREVTDLGHLPDPTGI FSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 

INSP082 MAGPGGWRDREVTDLGHLPDPTGI FSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 

INSP091 MAGPGGWRDREWDLGHLPDPTGIFSLDKTIGLGTYGRIYLGLHEKTGAFTAVKVMNARK 

** ***^* **•*******#- ***-«r ******* V*-**^ 

ZC4 DEEEDLRTELNLLRKYSFHKNIVSFYGAFFKLSPPG 

INSP081 TPLPE IGRRVRVNKYQKSVGWRYS DEEEDLRTELNLLRKYS FHKN I VS FYGAFFKLS P PG 

INSP082 TPLPE IGRRVRVNKYQKSVGWRYS DEEEDLRTELNLLRKYS FHKN I VS FYGAFFKLS PPG 

INSP091 TPLPE IGRRVRVNKYQKSVGWRYS DEEEDLRTELNLLRKYS FHKN I VS FYGAFFKLS PPG 

* *■ k * * k * * * * k * h k * k * +- k k k *■ k * *■ k -* * * * k * * * * * 4 * k ; * *■ k * * k * * * * * *r * * * * * * * * 

CF3 > 

ZC4 QRHQLWMVMELCAAGSVTDWRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 
INSP081 QRHQLWMVMELCAAGSVTDWRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 
INSP082 QRHQLWMVMELCAAGSVTDWRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 
INSP091 QRHQLWMVMELCAAGSVTDWRMTSNQSLKEDWIAYICREILQGLAHLHAHRVIHRDIKG 

^ CP 4 

ZC4 QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 
INSP081 QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 
INSP082 QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 
INSP091 QNVLLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDV 

* * * * * * * #■ * * ****** -V * * * * * * * * * * * *■ * * * * * * * *• * * * •* *• * * * * * * ******* * * * 

ZC4 WS VG I T AI EMAEGA P P LCN LQ PLE AL FV I LRE S A PT VK S S GW SRK FHN FMEKCT I KN FL F 

INSP081 WSVGITAIEMAEGAPPLCNLQPLEALFVI LRE SAPTVKSSGWSRK FHN FMEKCT I KNFLF 

INSP082 WSVGITAIEMAEGAPPLCNLQPLEALFVI LRESAPTVKSSGWSRKFHNFMEKCTIKNFLF 

INSP091 WSVGITAIEMAEGAPPLCNLQPLEALFVI LRE SAPTVKSSGWSRK FHN FMEKCT IKNFLF 

kk*kk^k*kk*k^^k^kM*k^*k^kk*k^^k**:k^kk^kkk***ir*ic**k*k**kr****-k 



ZC4 

INSP081 

INSP082 
INSP091 



ZC4 

INSP081 
INSP082 
INSP091 



ZC4 

INSP081 
INSP082 
INSP091 



► CP1 

RPTS ANMLQH P FVR DIKNERHWES LTRHLTG 1 1 KKRQKK 

RPTSANMLQHPFVRDIKNERHWESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRR 

R PT S ANMLQH P FVR DIKNERHWES LTRHLTG 1 1 KKRQKKG I PL I FEREEA I KEQYTVRR 

R PT SANMLQHP FVR DIKNERHWES LTRHLTG 1 1 KKRQKKG I PL I re REE AIKEQYTVRR 
*****************************.*•*.******* + * 

FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQ 
FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQ 
FRGPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQ 

ALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQVKAKASKPLQMQIKAPPRLRRAARVLMP 
ALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQVKAKASKPLQMQIKAPPRLRRAARVLMP 
ALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQVKAKASKPLQMQIKAPPRLRRAARVLMP 

< CP2 
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ZC4 

INSP081 
INSP082 
INSP091 



LQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQ 
LQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQ 
LQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQ 



ZC4 

INSP081 
INSP082 
INSP09X 



RHNQVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLL 
RHNQVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLL 
RHNQVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLL 



ZC4 

INSP081 
INSP082 
INSP091 



PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 
PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 
PLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAIGSVQALIEGLSRDLLRAPNSNNSKPLG 



ZC4 

INSP081 
INSP082 
INSP091 



-EQAREKKSKVSTLRQALAKRLS PKRFRAKSSWRPEKLELS DL 

PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 
PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 
PLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDL 



ZC4 

INSP081 
INSP082 
INSP091 



EARRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPK 
EARRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPK 
EARRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPK 
EARRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPK 



ZC4 

INSP081 
INSP082 
INSP091 



KI EVQERS PS V PNNQDHAHH VK FS S SVPQRSLLEQAQKPIDIRQRSSQN 

KIEVQERSPSVPNNQDHAHHVKFSSRTWHMLFCLFISVPQRSLLEQAQKPIDIRQRSSQN 
KIEVQERSPSVPNNQDHAHHVKFSSRTWHMLFCLFI SVPQRSLLEQAQKPIDIRQRSSQN 

KI EVQERS P SV PNNQDHAHHVK FS S SVPQRSLLEQAQKPIDIRQRSSQN 

★ a-*********************** ************************ 



ZC4 

INSP081 
INSP082 
INSP091 



RQNWLAASGDSKHKILAGKTQSYCLTIYISEVKKEEFQEGMNQKCQGAQVGLGPEGHCIW 

RQNWLAAS 

RQNWLAAS 

RQNWLAASGDSKHKILAGKTQSYCLTIYISEVKKEEFQEGMNQKCQGAQVGLGPEGHCIW 
******** 



ZC4 

INSP081 
INSP082 
INSP091 



QLGESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYN 

ESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYN 

ESSSEEESPVTGRRSQSS PPYST I DQKLLVDIHVPDGFKVGKI S PPVYLTNEWVGYN 

QLGE S S SEEES PVTGRRSQS S PPYST I DQKLLVDI HVPDG FKVGKI S PPVYLTNEWVGYN 



ZC4 

INSP081 
INSP082 
INSP091 



ALSEIFRNDWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDD 

ALSEI FRNDWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDD 

ALSEIFRNDWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDD 

ALSEI FRNDWLT PAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDD 
* + * + * + + + + ^ + + + + # + + + + + + + + ^ + 



ZC4 

INSP081 
INSP082 
INSP091 



LDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRASYGRDGSCKQDGYDGSRGKEE 
LDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRASYGRDGSCKQDGYDGSRGKEE 
LDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRASYGRDGSCKQDGYDGSRGKEE 
LDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRASYGRDGSCKQDGYDGSRGKEE 



ZC4 

INSP081 
INSP082 
INSP091 



AYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGWRTSEESGA 

AYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGG 

AYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGG 

AYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGVVRTSEESGA 
**+*+*****************+++*********************** 
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ZC4 LGLNGEENCSETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDI 

INSP081 GGNEASNAIDSGAAPSAPDHESDNKDI 

INSP082 GGNEASNAIDSGAAPSAPDHESDNKDI 

INSP091 LGLNGEENCSETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDI 

ZC4 SESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFVEVPEESPKQPSEVNVNPLYVSP 

INSP081 SESSTQSDFSANHSSPSKGSGMSADANFASAI 

INSP082 SESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFVEVPEESPKQPSEVNVNPLYVSP 

INSP091 SESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFVEVPEESPKQPSEVNVNPLYVSP 
******************************** 

ZC4 ACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQ 

INSP081 

INSP082 ACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQ 

INSP091 ACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQ 

ZC4 VLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGC 

INSP081 

INSP082 VLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGC 

INSP091 VLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGC 

2C4 EHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIKVFPTLDHKPVTVDLAIGSEKR 

INSP081 

INSP082 EHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIKVFPTLDHKPVTVDLAIGSEKR 

INSP091 EHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIKVFPTLDHKPVTVDLAIGSEKR 

ZC4 LKI FFSSADGYHLI DAESEVMS DVTLPKNPLEI 1 1 PQNI IILPDCLGIGMMLTFNAEALS 

INSP081 

INSP082 LKIFFSSADGYHLIDAESEVMSDVTLPKNPLEI I I PQNI IILPDCLGIGMMLTFNAEALS 

INSP091 LKI FFSSADGYHLI DAESEVMS DVTLPKNPLEI I I PQNI I ILPDCLGIGMMLT FN AEALS 

ZC4 VEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESELKRRSIKKL 

INSP081 

INSP082 VEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESELKRRSIKKL 

INSP091 VEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESELKRRSIKKL 

ZC4 R FLCT RG DKL FFT S TLRNHH S RVY FMTLGKLEE LQS N Y D V 

INSP081 

INSP082 RFLCTRGDKLFFTSTLRNHHSRVYFMTLGKLEELQSNYDV 

INSP091 RFLCTRGDKLFFTSTLRNHHSRVYFMTLGKLEELQSNYDV 
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Figure 8: Nucleotide sequence with translation of INSP082 PCR product cloned using 
primers INSP082-CP1 and INSP082-CP2. 



1 aggaacagta caccgtgaga agattcagag gaccctcttg cactcacgag cttctgagat 
INSP082-CP1 ^ 

eq ytvr rfr gps cthe llr 

61 tgccaaccag cagcagatgc agaccactta gagtcctgca tggggaaccc tctcagccaa 

1 p t ssrc rpl rvl hgep sqp 

121 ggtggctacc tgatcgagaa gagccacagg tccaggcact tcagcagcta cagggagcag 

r w 1 pdre epq vqa Iqql qga 

181 ccagggtatt catgccactg caggctctgg acagtgcacc taagcctcta aaggggcagg 

a r v fmpl qal dsa pkpl k g q 

241 ctcaggcacc tcaacgacta caaggggcag ctcgggtgtt catgccacta caggctcagg 

aqa pqrl qga arv fmpl qaq 

301 tgaaggctaa agcctctaaa cctctacaaa tgca ga^ttaa ggcacctcca cgact 

INSP082-CP2 

v k a kask plq inqi kapp r 



Position and sense of PCR primers 



2 ®/526? 
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Figure 9: Nucleotide sequence with translation of INSP082 PCR product cloned using 
primers INSP082-CP3 and INSP082-CP4. 



1 


taggaaggcg 


agtgagagt^g 


aataaatatc 


aaaaatctgt 


tgggtggaga 


tacagtgatg 




INSP082-CP3 












g r 


r v r v 


n k y 


q k s 


v g w r 


y s d 


61 


aggaagagga 


tctcaggact 


gaactcaacc 


ttctgaggaa 


gtactctttc 


cacaaaaaca 




e e e 


dirt 


e 1 n 


1 1 r 


k y s f 


h k n 


121 


ttgtgtcctt 


ctatggagca 


tttttcaagc 


tgagtccccc 


tggtcagcgg 


caccaacttt 




i v s 


f y g a 


f f k 


1 s p 


p g q r 


h q 1 


181 


ggatggtgat 


ggagttatgt 


gcagcaggtt 


cggtcactga 


tgtagtgaga 


atgaccagta 




w m v 


in e 1 c 


a a g 


s v t 


d v v r 


m t s 


241 


atcagagttt 


aaaagaagat 


tggattgctt 


atatctgccg 


agaaatcctt 


cagggcttag 




n q s 


iked 


w i a 


y i c 


r e i 1 


q g 1 


301 


ctcaccttca 


cgc/icaccga 


gtaattcacc 


gg 










INSP082-CP4 










a h 1 


h a h r 


v i h 


r 







Position and sense of PCR primers ^ 



